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Abstract 

The notions of real and user cardinality of a sign are introduced. Rank dis- 
tributions can be extended to arbitrary sign objects, i.e., semiotic systems. The 
dynamics of the distribution of consumer durables, such as automobiles, is studied. 

Usually, in semiotics only relatively short strings of signs of signs (discourses) are 
considered, while long strings with large parameters have not really been studied. We 
shall introduce notions generalizing the notion of participant in communicative and non- 
communicative sign systems: instead of the terms "narrateur" and "narrataire," or "inter- 
locuteur" and "interlocutaire" (see [I], p. 508]), we shall use the pair of terms generator 
and user. 

Semiotic objects, i.e., signs, can be of different types [HE]- 

A word of a natural language is a sign. The collection of words is the dictionary 
of signs. We use the term dictionary of signs rather than "alphabet of signs" to stress 
that the number of signs can be very large. The activity index of a sign is the number 
of its occurrences. We shall call this index the real cardinality of the sign. The real 
cardinality of the dictionary of signs is the total number of occurrences of all the signs from 
the dictionary (collection of signs). In language systems, the cardinality of a dictionary 
(collection of words) corresponds to the number of occurrences of the words in the corpus 
of texts used to compile the dictionary. 

Let us now consider books in a bookstore and let us consider the entire collection of 
books sold. Assume that each book has an inventory number. It is each copy of the 
book sold which is a sign, and its value (price) is the cardinality u of this sign. The 
additional money involved in the value of the book (storage expenses, overhead, etc.) 
must be included to get the user cardinality uj of the book. Here the generator is the 
group of accountants who determined the prices of books. 

Now consider the catalog of books for sale. Each opus (be it a novel, a collection 
of poems, or a textbook) is a sign, and its price is the cardinality u of the given sign. 
Valuation by user of this sign is the user cardinality uj. In this situation, all the sold 
copies of the same opus, as opposed to the previous example, are grouped together under 
one sign, which is specified by the title listed in the catalog (this notion is similar to that 
of descriptor in linguistics). 

Each article of law in a book of statutes is a sign. The entire list of laws is the 
dictionary of signs. Let us note that, in specific examples, one can incorrectly interpret 
the notion of sign and its cardinality. For instance, in the given example, the number of 
people who were arrested under the given article of the law is not, as one might think, the 
real cardinality of this sign (article), and the number of people who actually broke this 
article of law (whether they were arrested or not) is not its user cardinality. The person 
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(or persons) who created the book of statutes is not the generator of the signs. Similarly, 
in linguistics, the word forms that constitute a lexeme is not its cardinality. Word forms 
are actually signs in a lower hierarchy. 

The real cardinality uo of articles of law, regarded as signs, is the corresponding fine or 
the length of the prison term. The user cardinality u also includes all the unpleasantness 
related to the punishment (the quality of one's CV, separation from relatives, etc.). Here 
the generators are the lawmakers who specified the punishment for breaking the law. 

People sent to prison for breaking the law may also be regarded as signs (the inmates 
are even given serial numbers). The cardinality u is the length of the prison term. The 
generators in this case are the lawmakers, the judges, the prosecuting attorneys, etc. 

The sale of various goods will be considered below. Each type of goods will be a sign, 
its price is its cardinality. The generator is the person who fixed the prices. The set of 
all purchased types of goods is the dictionary of signs, the prices are the cardinalities, the 
customer is the user. 

In the last two examples, it is easy to confuse the sign with its cardinality. The user 
cardinality in these examples is quite realistic for the customers, say for those who are 
buying cars. Thus to obtain the user cardinality (price) of an automobile, one must add 
to its list price the actual expenses related to its upkeep, storage, insurance, spare parts, 
etc. 

Similarly, in the case of judicial punishment, the cardinality related to the actual 
losses for the prisoner (a spoiled CV, the alienation by the family, etc.) becomes quite 
real for the other people involved: for some the prisoner becomes an outcast, in some 
cases becomes a hero for others. 

In such cases, the user cardinality is not a monotone function of the real cardinality. 
For cheap cars, it increases with the decrease of the real cardinality, for instance, the inse- 
curity of the car becomes greater when its price decreases. Similarly, with the decrease of 
prison terms, beginning at some level, the related negative consequences do not decrease, 
and in fact increase relatively to the real cardinality. 

The generator should take into consideration the priorities, the tastes, the possibilities, 
and so on of the user. If the generator does not do this to a sufficient extent, then the 
experimental curve will not approximate the theoretical curve as well as it does in the 
automobile example shown bellow on Figures 1 and 2. 

For instance, if the generator (the lawmakers) does not take into consideration the 
mentality of the given "user" and compiles a set of laws under which practically any citizen 
constantly breaks the laws, and, since it is impossible to imprison everyone, the system 
starts putting in jail only those citizens which are in power dislike for some reason, this 
will lead to a totalitarian state where everyone lives in fear. In this case, the experimental 
curves will not fit the theoretical ones, because the absence of the preference principle 
(see [5]) on which the theoretical curves are based no longer applies. 

Let us pass to the description of our main approach to the general class of semiotic 
objets. 

The most important and difficult question is how the generator works out the car- 
dinality of the dictionary of signs. These cardinalities are worked out via a system of 
"agreements" between the generator and the user. The generator "produces a fictional 
action which places him at a higher level as compared to" the useJE 

If the generator, having recently passed the bar exam, begins to impose an ideal system 
of laws to the user, it will be rejected because it does not satisfy the social "rules of the 
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game," and the user will start reimplementing the lynching laws, or using the laws of the 
mama. 

Let us look at another, even more spontaneous, generator, which must include a huge 
number of people: it is impossible to specify how and by whom the cities and towns of a 
country were founded. What was the role of the interaction with neighbors, the greater 
security in numbers, the role of commerce, all these factors must be included in a very 
complicated and long algorithm. 

Our considerations are based on Kolmogorov's approach to randomness as maximal 
complexity (now known as Kolmogorov complexity, see [4]). This means that the longer 
the algorithm used by the generator to construct the collection of signs and their cardi- 
nalities, the nearer will the result be to the general position of the majority of all possible 
versions of these collections. This is similar to the fact that in playing "heads or tails," 
the longer the number of trials, the nearer will the sequence of heads and tails be to the 
"generic" version, in which in half the trials we get heads, and tails in the other half. And 
for the most part of the possible strings, we can apply the theorem from [3]. 

Indeed, let iVj be the number of signs of the same real cardinality u, while u is the 
user cardinality. We denote the whole user energiM, by 

s 

£ = J2 N ^i- 

8=1 

We can assume that the number of signs iVj corresponding to the given user cardinality 
uj of the sign Sj is a random variable with equiprobable distribution for any collection of 
{Ni} satisfying 

1) 

s 

i=i 

if 

s 

and 2) 

s 
i=l 

if 

— > 0Ji < £ < uj max N. 

s ^ 

i=i 

Obviously, £ < cD max iV, where N is the length of the dictionary of signs. 

This axiom should be understood in the sense that the given string of signs of the 
energia £ is one of many such strings with energia not greater than £, possessing the 
same dictionary of signs; here we assume that, at least for the most part of the signs, the 
energia is in general position with respect to all possible versions of the collection {./Vj}, 
provided the latter satisfies conditions 1) or 2). 

The case 1) has been proofed in [5]. We present bellow the proof of the case 2). 

As in [6J, the values of the random variable Si, . . . ,u s are ordered in absolute value. 
In our consideration, both the number of trials N and s tend to infinity. 
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Let Ni be the number of "appearances" of the value iOi : Ui < cuj+i, then 

s N 

E J^i = M > 

i=i 

where M is the mathematical expectation. 

The cumulative probability Vk is the sum of the first k probabilities in the sequence 
£>i' Vk — jr ^2i=i Ni, where k < s. We denote NPk = B^. 

If all the variants for which 

s 

X>* = iV ( 2 ) 

i=i 

and 

s 

N &i £ = MN > (3) 

i=i 

where u = ^= 1 are equivalent (equiprobable) , then [3] the majority of the variants 
will accumulate near the following dependence of the "cumulative probability" Bi{Ni} = 

i=l i=l 

where and z/ are determined by the conditions 

B s = N, (5) 

s ~ 

E _ i = g ' ( 6 ) 

i=i 

as iV — > oo and s ~ iV. By the condition ([3]) /?' < 0. 

We introduce the notation: Ai is the set of all sets {Ni} satisfying conditions (j2J) 
and (j3J); 7V{.M} is the number of elements of the set A4. 

Theorem 1 Suppose that all the variants of sets {Ni} satisfying the conditions (TJ|) 
and (T3|) are equiprobable. Then the number of variants M of sets {N{} satisfying condi- 
tions |1P and (TJj) and the additional relation 

8=1 1 

is less than ^-jMr^- (where c\ and m are any arbitrary numbers, I > eN, and e is arbi- 
trarily small). 

Proof of Theorem 1. 

Let A be a subset of M. satisfying the condition 

s s 1 

I y at, - y —1 |<A; 

i=l+l i=l+l 



I I ^ 

1=1 1=1 
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where A, j3, v are some real numbers independent of I. 
We denote 



V - V -= 1 = 



i=l+l i=l+l 

I I 



i=i i=i 

Obviously, if {A^} is the set of all sets of integers on the whole, then 

s 

Af{M \ A} = (9£ NiUi - £)6 {U=1 Ni)lN e(Si - A)Q(S s - l - A) 

{Ni} i=l 

where Aj = iV. 

Here the sum is taken over all integers Aj, 9(u3) is the Heaviside function, and Sk u k 2 
is the Kronecker symbol. 

We use the integral representations 

„—vN pit 

W = V- / #e-^e^'e w >, (9) 



— 7T 
OO 



1 /* 1 

e(y) = — / e^ 1+ H (10) 

2vn J_ 00 u - i 

Now we perform the standard regularization. We replace the first Heaviside function 
in (|Sj) by the continuous function 



® a (y) 



for a > 1, y < 

1 - e ^ (1 ^ a) for a > 1, 2/ > 0, 



g /9y(l-a) for a < Q) y < 

1 for a < 0, y > 0, 
where a G (— oo, 0) U (1, oo) is a parameter, and obtain 



1 r°° 1 1 

= 7T- / : : )dx. (11) 

Z7TZ ./-oo x — I x — at 



If a > 1, then Q(y) < Q a (y). 

Let v < 0. We substitute ([9]) and ( fTOl) into (jHJ), interchange the integration and 
summation, then pass to the limit as a — > oo and obtain the estimate 

Af{M \A}< 



< 



e -vN+l3S [-K 



i{2 



7T 



/7T J J 

[exp(-iiV^) ^ exp{-/3 ^ Nfij + {icp + v) ^ A^-}] dp 

_7r IAf,l .7=1 i=l 



xe(S , i -A)0(S' s _ i -A)|, (12) 

where (3 and i/ are real parameters such that the series converges for them. 

To estimate the expression in the right-hand side, we bring the absolute value sign 
inside the integral sign and then inside the sum sign, integrate over (p, and obtain 

-uN+f3E s s 

M{M \ A} < ^ ex P^ E N & + "Y. N ^ X 

{Ni} i=l i=l 

xQ(Si - A)Q(S s -i - A). (13) 



We denote 

Z(J3,N) = ^2e- fi ^ NiSi , (14) 

m} 

where the sum is taken over all iVj such that Yli=i N% = 

l s 

i=l i=l+l 

&(^) = (1 _ e l-/^) > i=l,...,«- 

It follows from the inequality for the hyperbolic cosine cosh(x) = (e x + e~ x )/2 for 
\xx\ >S;\x 2 \>5: 

e s 

cosh(a;i) cosh^) > — (15) 
2 

that the inequality 

l s 

e(S s ^ - A)Q(S l — A) < e~ cA cosh (c^iV; - c0,) cosh(c ^ JV* - <4 S -X (16) 

i=l i=l+l 

where 



1 - 1 

e P'wi-v' _ 4>s-l— ^ e Pwi-v _ ]_ ; 

i=l t=i+l 



holds for all positive c and A. 
We obtain 



N{M \A}< e~ cA exp (J3S - uN) x 



i i / i \ 

ex P {-/3 ^2 + v ^2 N i} c ° sh ( Yl cNi - ^ ) 

{Ni} i=l i=l \i=l / 

s s s 

exp{-/3 iVitDi + i/ A^i} cosh( cNi - c0 



i=Z+l i=H-l i=l+l 



e^e" cA x 



x (0(z/ - c, /3) exp(-c0/) + Ci(v + c, /3) exp(c^)) x 

x (Ca-i(v ~ c, (3) exp(-c0 s _,) + ( s -i{v + c, /3) exp(c0 s _i)) . (17) 

Now we use the relations 

d d 

and the expansion Q{y ± c, 0) by the Taylor formula. There exists a 7 < 1 such that 

ln(0(f±c,/3)) = ln0(^,/3) ±c(lnO)U^/5) + y QnCz)> ± TC, /?)■ 
We substitute this expansion, use formula (fT51) . and see that 0^ is cancelled. 



Another representation of the Taylor formula implies 

In (0(1/ + c, (3)) = In i/)) + ^ In (6(0, *)) + 

+ J dv\v + c/[3-v')—\n(a[3,v')). (19) 

A similar expression holds for ^s-z- 

From the explicit form of the function Q(/3, v), we obtain 

— In u)) = P ^ (exp( _^ + v)) _ 1)2 ^ P sd > ( 20 ) 
where d is given by the formula 

exp(-/3(u; s + v)) 



d 



(exp(-/3(2 fl + !/)) - l) 5 



The same estimate holds for £ s _j. 

Taking into account the fact that CiCs-i = (s, we obtain the following estimate for 
P = p and v = v': 

r 2 

M{M \ A} < UP', v 1 ) exp(-cA + -p 2 sd) exp(£p' - u'N). (21) 

Now we express C, s {y' , P') in terms Z(P, N). To do this, we prove the following lemma. 
Lemma 1 Under the above assumptions, the asymptotics of the integral 

p —vN PIT 

Z(P,N) = — dae~ ma UP,u + i a) (22) 



has the form 



\ UP,") n , 1 



z ^'' ft pfiw ' 1 ^^ 1 ' (23) 



where C is a constant. 
We have 



p — vN rn p—vN rtr 

Z(P,N) = - / e- lNa ( s {p,u + ta)da= / e NS ^da, (24) 

27T J_ w 2ir J_ n 

where 

s 

S(a,N) = -ia + \n( s (P,v + ia) = -ia - e u+ia ~^\. (25) 

i=i 

Here S 1 depends on N, because s, u5j, and v also depend on N; the latter is chosen so that 
the point a = be a stationary point of the phase S 1 , i.e., from the condition 

N = Y—^ . (26) 



i=l 



We assume that aiN < s < G^iV, 01,02 = const, and, in addition, < uii < B and 
B = const, i — 1, . . . , s. If these conditions are satisfied in some interval P G [0, Pq] of the 
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values of the inverse temperature, then all the derivatives of the phase are bounded, the 
stationary point is nondegenerate, and the real part of the phase outside a neighborhood 
of zero is strictly less than its value at zero minus some positive number. Therefore, 
calculating the asymptotics of the integral, we can replace the interval of integration 
[— 7r, 7r] by the interval [—£,£]. In this integral, we perform the change of variable 



z = y/S(0,N)-S(a,N). (27) 

This function is holomorphic in the disk \a\ < e in the complex a-plane and has a 
holomorphic inverse for a sufficiently small e. As a result, we obtain 

£ e NS ^da = e NS ^ [ e~ Nz " f{z) dz, (28) 

where the path 7 in the complex z-plane is obtained from the interval [— e, e] by the 
change (12"T]) and 

'dy/S{0,N) -S{a,N) 



da 



(29) 

a=a(z) 



For a small e the path 7 lies completely inside the double sector re(z 2 ) > c(rez) 2 for some 
c > 0; hence it can be "shifted" to the real axis so that the integral does not change up 
to terms that are exponentially small in N. Thus, with the above accuracy, we have 



-vN /•£ 

-Nz 2 



Z{p,N) = — / e~ Nz f(z)dz. (30) 

^ J-e 

Since the variable z is now real, we can assume that the function f(z) is finite (changing 
it outside the interval of integration), extend the integral to the entire axis (which again 
gives an exponentially small error), and then calculate the asymptotic expansion of the 
integral expanding the integrand in the Taylor series in z with a remainder. This justifies 
that the saddle-point method can be applied to the above integral in our case. 



Lemma 2 The quantity 



^E-ffiw'*, (31) 



where ^ Ni = N and uJiNi > £ + iV 1//2+e , tends to zero faster than N k for any k, e > 0. 

We consider the point of minimum in (3 of the right-hand side of (TlTI) with v(/3, N) 
satisfying the condition 

y I = JV 

It is easy to see that it satisfies condition (jSJ). Now we assume that the assumption of the 
lemma is not satisfied. 

Then for J2 N i = N ,Y, ^i N i > E + N^ 2+£ , we have 

e P£ e~ p ^=^ NiSi > e (iVl/2+e)/3 . 

{Ni} 

Obviously, (3 <^ provides a minimum of (iT7j) if the assumptions of Lemma 1 are 
satisfied, which contradicts the assumption that the minimum in f3 of the right-hand side 
of (TTTT) is equal to /3'. 



s 



We set c = N ^ +a in formula fl2T|) after the substitution fl23|) ; then it is easy to see that 
the ratio 

N\M \ A) _ 1 
Af(M) ~ N m ' 

where m is an arbitrary integer, holds for A = jV 3 / 4+e . The proof of the theorem is 
complete. 

We prove a cumulative formula in which the densities coincide in shape with the Bose- 
Einstein distribution with negative temperature. The difference consists also in that, 
instead of the set u n of random variables or eigenvalues of the Hamiltonian operator, the 
formula contains some of their averages over the cells. In view of our theorem, the £j, 
which are averages of the energy u)k at the zth cell, are nonlinear averages in the sense of 
Kolmogorov [7]. 

Let us number the signs constituting the dictionary in the order of increase of their 
cardinality, beginning with the minimal cardinality uj m i n . The signs that have the same 
cardinality are ordered arbitrarily. The number of each sign in this ordering will be called 
its rank and denoted by r. If / is the number of the signs of cardinality uj\ (beginning 
from c^min), then by r\ we shall denote the number of all signs with cardinality less than 
or equal to ui\. By r_; we shall denote the number of all signs with cardinality greater 
than ui, so that r\ + r„; is the total number of all signs. 

Exactly as in the article [8], we see that the rank r/ of the signs, ordered by increasing 
cardinality, satisfies relations (3), (5), and (8) appearing in [8]. 

Let us set 

u)i = LUi(l + aujj + ct -1 ^" 7 ), 

then as (3 <C 1 

Cl ( 1 n M/7 

r i = ri — 7+ c 2; w i = ( — • 

1 + aoy a r_; 

Figures 1 and 2 show how well the generators of the prices of American automobiles 
estimate the demand. The first plot shows the dependence of the number of cars sold at a 
price equal to or less than UJ on the price, the second one, the dependence of the number 
of the car in the "dictionary of cars" (this number can be regarded as the detailed make 
of the car) in the increasing order of the car prices. The generators here are the people 
who determined the price. The point of inflection of the graph corresponds to the price 
level where the additional expenses are minimal. We see that this point is practically the 
same on both plots. This means that the "agreement" between the generator and the 
user in this case reaches the high level. 
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Figure 1: Number of cars with price < uo. The thin line represents the theoretical curve r(tu). 
The mean quadratic error is a = 0.0188674. 
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